Accelerating Mobile Video: A 64-Bit SIMD Architecture for Handheld Applications
نویسندگان
چکیده
Providing quality mobile video applications in hand-held mobile devices requires increased computational capability. Using Single Instruction Multiple Data (SIMD) techniques to expose and accelerate the data parallelism inherent in video processing increases performance in handheld and wireless systems. The paper introduces a new 64-bit SIMD coprocessor of the Intel©R XScale©R microarchitecture which is optimized for low-power handheld applications. The architecture blends the SIMD media processing style with the capabilities of the XScale microarchitecture. This paper provides an overview of the architecture, its instruction set, programming model, the pipeline organization and functional units. The paper also describes how key features of architecture improve the performance of video applications as compared to a scalar implementation. The performance and power improvements based upon measured results are analyzed to show how the opportunities of power savings by reducing the frequency and voltage can be realized.
منابع مشابه
Accelerating Mobile Multimedia with the Intel® PXA27x Processor Family
Demand for mobile video applications is growing today in wireless handheld platforms. The Intel® PXA27x processor family has been designed to accelerate mobile multimedia and applications processing in a power efficient manner. The PXA27x processor is a highly integrated system on a chip including the Intel XScale® Microarchitecture with 64-bit Intel® Wireless MMXTM technology, 256KBytes of on-...
متن کاملRefining Instruction Set Architecture for High-Performance Multimedia Processing in Constrained Environments
Multimedia processing in software has been significantly accelerated by the addition of subword-parallel instructions to the instruction set architectures (ISAs) of modern microprocessors. While some of these multimedia instructions are simple and effective, others are very complex, requiring large, special-purpose functional units that are not practical for constrained environments such as han...
متن کاملFaster Set Intersection with SIMD instructions by Reducing Branch Mispredictions
Set intersection is one of the most important operations for many applications such as Web search engines or database management systems. This paper describes our new algorithm to efficiently find set intersections with sorted arrays on modern processors with SIMD instructions and high branch misprediction penalties. Our algorithm efficiently exploits SIMD instructions and can drastically reduc...
متن کاملPerformance Analysis of Intel's MMX and SSE: A Case Study
The MMX and SSE extensions of current Intel Pentium processors ooer a 4-way or 8-way SIMD parallelism to accelerate many vector or matrix applications. In this paper the performance of MMX and SSE for the implementation of neural networks is evaluated. It is shown that a speedup in the range from 1.3 to 9.8 for single neural operations and a total speedup of up to 4.1 for the simulation of a co...
متن کاملImplementing a DVB-T/H Receiver on a Software-Defined Radio Platform
Digital multimedia broadcasting is available in more and more countries with various forms. One of the most successful forms is Digital Video Broadcasting for Terrestrial (DVB-T), which has been deployed in most countries of the world for years. In order to bring the digital multimedia broadcasting services to battery-powered handheld receivers in a mobile environment, Digital Video Broadcastin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- VLSI Signal Processing
دوره 41 شماره
صفحات -
تاریخ انتشار 2005